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pertaining to these essential IPRs, if any, is publicly available for ETSI members and non-members, and can be found 
in ETSI SR 000 314: "Intellectual Property Rights (IPRs); Essential, or potentially Essential, IPRs notified to ETSI in 
respect of ETSI standards", which is available from the ETSI Secretariat. Latest updates are available on the ETSI Web 
server ( http://webapp.etsi.org/IPR/home.asp ). 

Pursuant to the ETSI IPR Policy, no investigation, including IPR searches, has been carried out by ETSI. No guarantee 
can be given as to the existence of other IPRs not referenced in ETSI SR 000 314 (or the updates on the ETSI Web 
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Foreword 

This Technical Specification (TS) has been produced by ETSI 3rd Generation Partnership Project (3GPP). 

The present document may refer to technical specifications or reports using their 3GPP identities, UMTS identities or 
GSM identities. These should be interpreted as being references to the corresponding ETSI deliverables. 

The cross reference between GSM, UMTS, 3GPP and ETSI identities can be found under 
http://webapp.etsi.org/kev/quervform.asp . 



ETSI 



3GPPTS 26.173 version 6.1.0 Release 6 3 ETSI TS 126 173 V6.1.0 (2006-12) 



Contents 



Intellectual Property Rights 2 

Foreword 2 

Foreword 4 

1 Scope 5 

2 References 5 

3 Definitions and abbreviations 5 

3.1 Definitions 5 

3.2 Abbreviations 5 

4 C code structure 6 

4.1 Contents of the C source code 6 

4.2 Program execution 6 

4.3 Code hierarchy 6 

4.5 Variables, constants and tables 10 

4.5.1 Description of constants used in the C-code 11 

4.5.2 Description of fixed tables used in the C-code 11 

4.5.3 Static variables used in the C-code 13 

5 Homing procedure 16 

6 File formats 16 

6.1 Speech file (encoder input / decoder output) 16 

6.2 Mode control file (encoder input) 16 

6.3 Parameter bitstream file (encoder output / decoder input) 17 

Defauh 3GPP format: 17 

ITU format (activated with command line parameter -itu) 18 

MIME/file storage format (activated with command line parameter -mime) 18 

Annex A (informative): Change history 19 

History 20 



£75/ 



3GPPTS 26.173 version 6.1.0 Release 6 4 ETSI TS 126 173 V6.1.0 (2006-12) 



Foreword 



rd , 



This Technical Specification (TS) has been produced by the 3 Generation Partnership Project (3GPP). 

The contents of the present document are subject to continuing work within the TSG and may change following formal 
TSG approval. Should the TSG modify the contents of the present document, it will be re-released by the TSG with an 
identifying change of release date and an increase in version number as follows: 

Version x.y.z 

where: 

X the first digit: 

1 presented to TSG for information; 

2 presented to TSG for approval; 

3 or greater indicates TSG approved document under change control. 

y the second digit is incremented for all changes of substance, i.e. technical enhancements, corrections, 
updates, etc. 

z the third digit is incremented when editorial only changes have been incorporated in the document. 
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Scope 



The present document contains an electronic copy of the ANSI-C code for the Adaptive Multi-Rate Wideband codec. 
The ANSI-C code is necessary for a bit exact implementation of the Adaptive Multi Rate Wideband speech transcoder 
(3GPP TS 26.190 [2]), Voice Activity Detection (3GPP TS 26.194 [6]), comfort noise (3GPP TS 26.192 [4]), source 
controlled rate operation (3GPP TS 26.193 [5]) and example solutions for substituting and muting of lost frames (3GPP 
TS 26.191 [3]). 



References 



The following documents contain provisions which, through reference in this text, constitute provisions of the present 
document. 

• References are either specific (identified by date of publication, edition number, version number, etc.) or 
non-specific. 

• For a specific reference, subsequent revisions do not apply. 

• For a non-specific reference, the latest version applies. In the case of a reference to a 3GPP document (including 
a GSM document), a non-specific reference implicitly refers to the latest version of that document in the same 
Release as the present document. 

[1] 3GPP TS 26.174: "AMR Wideband Speech Codec; Test sequences". 

[2] 3GPP TS 26.190: "AMR Wideband Speech Codec; Speech transcoding". 

[3] 3GPP TS 26.191: "AMR Wideband Speech Codec; Substitution and muting of lost frames". 

[4] 3GPP TS 26.192: "AMR Wideband Speech Codec; Comfort noise aspects". 

[5] 3GPP TS 26.193: "AMR Wideband Speech Codec; Source controlled rate operation". 

[6] 3GPP TS 26.194: "AMR Wideband Speech Codec; Voice Activity Detection". 

[7] RFC 3267 'A Real-Time Transport Protocol (RTP) Payload Format and File Storage Format for 

Adaptive Multi-Rate (AMR) and Adaptive Multi-Rate Wideband (AMR-WB) Audio Codecs, June 
2002. 



Definitions and abbreviations 



3.1 Definitions 

Definition of terms used in the present document, can be found in 3GPP TS 26.190 [2], 3GPP TS 26.191 [3], 3GPP 
TS 26.192 [4], 3GPPTS 26.193 [5] and 3GPP TS 26.194 [6]. 

3.2 Abbreviations 

For the purpose of the present document, the following abbreviations apply: 

AMR-WB Adaptive Multi-Rate Wideband 

ANSI American National Standards Institute 

ETS European Telecommunication Standard 

GSM Global System for Mobile communications 

I/O Input/Output 

RAM Random Access Memory 

ROM Read Only Memory 
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C code structure 



This clause gives an overview of the structure of the bit-exact C code and provides an overview of the contents and 
organization of the C code attached to this document. 

The C code has been verified on the following systems: 

Sun Microsystems workstations and GNU gcc compiler 

HP workstations and cc compiler 

IBM PC compatible computers with Windows NT4 operating system and GNU gcc compiler. 
ANSI-C was selected as the programming language because portability was desirable. 

4.1 Contents of the C source code 

The C code distrubution has all files in the root level. 

The distributed files with suffix "c" contain the source code and the files with suffix "h" are the header files. The ROM 
data is contained mostly in files with suffix "tab". 

The C code distribution also contains one speech coder installation verification data file, "spch_dos.inp". The reference 
encoder output file is named "spch_dos.cod", the reference decoder input file is named "spch_dos.dec" and the 
reference decoder output file is named "spch_dos.out". These four files are formatted such that they are correct for an 
IBM PC/AT compatible computer. The same files with reversed byte order of the 16 bit words are named 
"spch_unx.inp", "spch_unx.cod", "spch_unx."dec" and "spch_unx.out", respectively. 

Final verification is to be performed using the GSM Adaptive Multi-Rate Wideband test sequences described in 3GPP 

TS 26.174 [1]. 

Makefiles are provided for the platforms in which the C code has been verified (listed above). Once the software is 
installed, this directory will have a compiled version of encoder and decoder (the bit-exact C executables of the speech 
codec) and all the object files. 

4.2 Program execution 

The GSM Adaptive Multi-Rate Wideband codec is implemented in two programs: 

{encoder) speech encoder; 

{decoder) speech decoder. 
The programs should be called like: 

encoder [encoder options] <speech input file> <parameter file>; 

decoder <parameter file> <speech output file>. 

The speech files contain 16-bit linear encoded PCM speech samples and the parameter files contain encoded speech 
data and some additional flags. 

The encoder and decoder options will be explained by running the applications without input arguments. See the file 
readme.txt for more information on how to run the encoder and decoder programs. 



4.3 Code hierarchy 



Tables 1 to 3 are call graphs that show the functions used in the speech codec, including the functions of VAD, DTX, 
and comfort noise generation. 

Each column represents a call level and each cell a function. The functions contain calls to the functions in rightwards 
neighboring cells. The time order in the call graphs is from the top downwards as the processing of a frame advances. 
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All standard C functions: printf(), fwrite(), etc. have been omitted. Also, no basic operations (add(), L_add(), mac(), 
etc.) or double precision extended operations (e.g. L_Extract()) appear in the graphs. The initialization of the static 
RAM (i.e. calling the _init functions) is also omitted. 

The basic operations are not counted as extending the depth, therefore the deepest level in this software is level 6. 

The encoder call graph is broken down into two separate call graphs. Table 1 to 2. 
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Table 1 : Speech encoder call structure 



coder 


Copy 








Decim_1 2k8 


Down samp 


Interpol (function) 




Copy 






Set zero 






HP50 12k8 




Scale sig 




wb_vad 


Filterbank 


Filters 




Filter3 




Level calculation 




vad_decision 


llog2 




Noise estimate update 


update cntrl | 


hangover addition 




Estimate Speech 






tx dtx handler 






Farm serial 




Autocorr 




Lag window 




Levinson 




Az isp 


Chebps2 




Int isp 


Isp Az 


Get isp pol 




Isp isf 








Gp clip test isf 




Weigtit a 




Residu 




Deemph2 




LP Decim2 




Scale mem Hp wsp 




Pitch_med_ol 


Hp wsp 




Isqrt n 




wb vad tone detection 






Med olag 


medians 




dtx buffer 


Copy 




dtx_enc 


Find frame indices 




Aver isf history 




Qisf_ns 


Sub VQ 




Disf ns 


Reorder isf | 


Farm serial 






Pow2 




Random 




Dot product12 




Isqrt n 




Isf isp 






Isp Az 


Get isp pol 




Synttiesis 


Copy 




Syn flit 32 




Deemph 32 




HP50 12k8 




Random 




Scale sig 




Dot product12 




Isqrt n 




HP400 12k8 




Weight a 




Syn flit 




Flit 6k 7k 




Reset_encoder 


Set zero 




lnit_gp clip 




Init Phase dispersion 


Set zero 




Qpisf_2s_36b 


VQ stagel 






Sub VQ 




Dpisf 2s 36b 


Reorder isf 




Qpisf_2s_46b 


VQ stagel 






Sub VQ 




Dpisf 2s 46b 


Reorder isf 




Syn flit 








Preemph2 




Pitch_fr4 


NormCorr 


Convolve 




Isqrt n 




Interpol 4 






Gp clip 






Pred It4 




Convolve 




G pitcfi 


Dot product12 




Updt tar 






Preempt! 




Pit stirp 




Cor ti X 




ACELP_2t64Jx 


Dot product12 




Isqrt n 




ACELP 4t64 fx 


See Table 2 




Q_gain2 


Dot product12 




Pow2 




Gp clip test gain pit 






voice factor 


Dot_product1 2 
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Table 2: ACELP 4t64 fx call structure 



ACELP_4t64Jx 


Dot product12 








Isqrt n 




cor h vec 




search ixiy 




quant 1p N1 




quant 2p 2N1 




quant_3p_3N1 


quant 2p 2N1 




quant 1p N1 




quant_4p_4N 


quant 4p 4N1 


Quant 2p 2N1 




quant 1p N1 






quant_3p_3N1 


Quant 2p 2N1 




Quant 1p N1 




quant 2p 2N1 






quant_5p_5N 


quant_3p_3N1 


Quant 2p 2N1 




Quant 1p N1 




quant 2p 2N1 






quant_6p_6N_2 


quant_5p_5N 


Quant_3p_3N1 


quant 2p 2N1 


Quant 1p N1 


quant 2p 2N1 




quant 1p N1 






quant„4p_4N 


quant 4p 4N1 


quant 2p 2N1 


quant 1p N1 




quant_3p_3N1 


quant 2p 2N1 


quant 1p N1 


quant 2p 2N1 




quant 2p 2N1 






quant_3p_3N1 


quant 2p 2N1 




Quant_1p_N1 
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Table 3: Speech decoder call structure 



decoder 


Rx dtx handler 










Dtx_dec 


Copy 




Disf ns 


Reorder isf 




Serial_parm 






Pow2 




Random 




Dot product! 2 




Isqrt n 




Serial_parm 






Isf isp 




Isp Az 


Get isp pel 




Copy 






Synthesis 


Copy 




Syn filt 32 




Deemph 32 




HP50 12l<8 




Oversampi 6k 


Copy 




Up samp 


Interpol 




Random 








Scale sig 




Dot product! 2 




Isqrt n 




HP400 12k8 




Isf Extrapolation 


Isf isp 




Isp Az 


Get isp pol 




Weight a 






Syn filt 




Filt 6k 7k 


Copy 




Filt 7k 


Copy 




Reset_decoder 


Set zero 






Init Phase dispersion 


Set zero 




Dpisf 2s 36b 


Reorder isf 






Dpisf 2s 46b 


Reorder isf 




Int isp 


Isp Az 


Get isp pol 




Lagconc 


insertion sort 


Insert 




Random 






Pred It4 






Random 




DEC ACELP 2t64 fx 




DEC_ACELP_4t64Jx 


dec 1p N1 




add pulses 




dec 2p 2N1 




dec_3p_3N1 


Dec 2p 2N! 




dec !p N! 




dec_4p_4N 


dec 4p 4N! 


dec 2p 2N! 




dec !p N! 






Dec_3p_3N! 


Dec 2p 2N! 




Dec !p N! 




Dec 2p 2N! 






dec_5p_5N 


dec_3p_3N! 


Dec 2p 2N! 




Dec !p N! 




Dec 2p 2N! 






dec_6p_6N_2 


Dec_5p_5N 


dec_3p_3N! 


Dec 2p 2N! 


Dec !p N! 


dec 2p 2N! 




dec Ip Nl 






dec_4p_4N 


dec 4p 4N! 


dec 2p 2N! 


dec !p N! 




Dec_3p_3N! 


Dec 2p 2N! 


Dec !p N! 


Dec 2p 2N! 




dec 2p 2N! 






dec_3p_3N! 


Dec 2p 2N! 




Dec !p N! 




Preemph 










Pit shrp 




D_gain2 


Dot product! 2 




Isqrt n 




Medians 




Pow2 




Scale sig 






voice factor 


Dot product! 2 




Phase dispersion 


Set zero 




Agc2 


Isqrt 


Isqrt n 




Set zero 








Dtx_dec_activity_update 


Copy 





4.5 



Variables, constants and tables 



The data types of variables and tables used in the fixed point implementation are signed integers in 2's complement 
representation, defined by: 

- Wordl6 16 bit variable; 

- Word32 32 bit variable. 
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4.5.1 Description of constants used in tine C-code 

This subclause contains a listing of all global constants defined in cnst.h. 

Table 5: Global constants 



Constant 


Value 


Description 


L TOTAL 


384 


total size of speech buffer. 


L WINDOW 


384 


window size in LP analysis 


L NEXT 


64 


Look-ahead size 


L FRAME 


256 


frame size in 12.8 kHz 


L FRAME16k 


320 


frame size in 16 kHz 


L SUBFR 


64 


Subframe size in 12.8 kHz 


L SUBFR16k 


80 


Subframe size in 16 kHz 


NB SUBFR 


4 


Number of subframes 


M16k 


20 


order of LP filter in high-band synthesis in 6.60 mode 


M 


16 


order of LP filter 


L FILT16k 


15 


Delay of down-sampling filter in 1 6 kHz 


L FILT 


12 


Delay of down-sampling filter in 12.8 kHz 


GP CLIP 


15565 


Pitch gain clipping 


PIT SHARP 


27853 


pitch sharpening factor 


PIT MIN 


34 


minimum pitch lag (all modes) 


PIT FR2 


128 


Minimum pitch lag with resolution Vi 


PIT FR1 9b 


160 


Minimum pitch lag with resolution for 9 bit quantization 


PIT FR1 8b 


92 


Minimum pitch lag with resolution for 8 bit quantization 


PIT MAX 


231 


maximum pitch lag 


L INTERPOL 


(16+1) 


length of filter for interpolation 


OPL DECIM 


2 


Decimation in open-loop pitch analysis 


PREEMPH FAC 


22282 


preemphasis factor 


GAMMA1 


30147 


Weighting factor (numerator) 


TILT FAC 


22282 


tilt factor (denominator) 


MAX 


8 


scaling max for signal 


RANDOM INITSEED 


21845 


random init value 


L MEANBUF 


3 


Size of ISF buffer 


ONE PER MEANBUF 


10923 


Inverse of L MEANBUF 



4.5.2 Description of fixed tables used in the C-code 

This section contains a hsting of all fixed tables sorted by source file name and table name. All table data is declared as 
Wordl6. 
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Table 6: Fixed tables 



File 


Table name 


Length 


Description 


C4t64fx.c 


Tipos 


36 


starting points of iterations 


Cod_main.c 


HP_gain 


16 


High band gain table for 23.85 kbit/s mode 


Cod_main.c 


Interpoljrac 


4 


LPC interpolation coefficients 


Cod_main.c 


Ispjnit 


16 


isp tables for initialization 


Cod_main.c 


Isfjnit 


16 


isf tables for initialization 


D_gain2.c 


cdown_unusable 


7 


attenuation factors for codebook gain in lost frames 


D_gain2.c 


cdown_usable 


7 


attenuation factors for codebook gain in bad frames 


D_gain2.c 


pdown_unusable 


7 


attenuation factors for adaptive codebook gain in lost frames 


D_gain2.c 


pdown usable 


7 


attenuation factors for adaptive codebook gain in bad frames 


D_gain2.c 


Pred 


4 


algebraic code book gain MA predictor coefficients 


Dec_main.c 


HP_gain 


16 


High band gain table for 23.85 kbit/s mode 


Dec_main.c 


Interpoljrac 


4 


LPC interpolation coefficients 


Dec_main.c 


Ispjnit 


16 


isp tables for initialization 


Dec_main.c 


Isfjnit 


16 


isf tables for initialization 


Decim54.c 


fir_down 


120 


Downsample FIR filter coefficients 


Decim54.c 


fir_up 


120 


Upsample FIR filter coefficients 


Dtx.c 


en_adjust 


9 


Energy scaling factor for each mode during comfort noise 


GridlOO.tab 


grid 


101 


grid points at wich Chebyshev polynomials 


Ham wind.tab 


Window 


384 


LP analysis window 


Hp400.c 


A 


3 


HP filter coefficients (denominator) in higher band energy estimation 


Hp400.c 


B 


3 


HP filter coefficients (numerator) in higher band energy estimation 


HpSO.c 


A 


3 


HP filter coefficients (denominator) in pre-filtering 


HpSO.c 


B 


3 


HP filter coefficients (numerator) in pre-filtering 


Hp6k.c 


Fir 6k 7k 


31 


Bandpass FIR filter coefficients for higher band generation 


Hp7k.c 


Fir 7k 


31 


Bandpass FIR filter coefficients for higher band in 23.85 kbit/s mode 


Hpwsp.c 


A 


3 


HP filter coefficients (denominator) in open-loop lag gain computation 


Hpwsp.c 


B 


3 


HP filter coefficients (numerator) in open-loop lag gain computation 


Ispjsf.tab 


slope 


128 


table to compute cos(x) in LsfJspO 


Ispjsf.tab 


Table 


129 


table to compute acos(x) in Lspjsf() 


Lag_wind.tab 


lag h 


16 


high part of the lag window table 


Lag_wind.tab 


lagj 


16 


low part of the lag window table 


Lp_dec2.c 


hjir 


5 


HP FIR filter coefficients in open-loop lag search 


Math_op.c 


tablejsqrt 


49 


table used in inverse square root computation 


Math_op.c 


table_pow2 


33 


table used in power of two computation 


P_med_ol.tab 


Corrweight 


199 


weighting of the correlation function in open loop LTP search 


Ph_disp.c 


phjmpjow 


64 


phase dispersion impulse response 


Ph_disp.c 


phjmp_mid 


64 


phase dispersion impulse response 


Pitch fr4.c 


inter4_1 


32 


interpolation filter coefficients 


Pred_lt4.c 


inter4_2 


128 


interpolation filter coefficients 


Q_gain2.c 


pred 


4 


algebraic code book gain MA predictor coefficients 


Q_gain2.tab 


t_qua_gain6b 


2*64 


gain quantization table for 6-bit gain quantization 


Q_gain2.tab 


t_qua_gain7b 


2*128 


gain quantization table for 7-bit gain quantization 


Qisf_ns.tab 


dico1Jsf_noise 


2*64 


1^' ISF quantizer for comfort noise 


Qisf_ns.tab 


dico2Jsf_noise 


3*64 


2"'' ISF quantizer for comfort noise 


Qisf_ns.tab 


Dico3Jsf_noise 


3*64 


3'^^ LSF quantizer for comfort noise 


Qisf_ns.tab 


Dico4Jsf_noise 


4*32 


4* LSF quantizer for comfort noise 


Qisf_ns.tab 


Dico5Jsf_noise 


4*32 


5* LSF quantizer for comfort noise 


Qisf_ns.tab 


meanJsf_noise 


16 


ISF mean for comfort noise 


Qpisf_2s.tab 


dicdjsf 


9*256 


l" ISF quantizer of the l" stage 


Qpisf_2s.tab 


Dico2Jsf 


7*256 


2"" ISF quantizer of the l" stage 


Qpisf_2s.tab 


Dico21Jsf 


3*64 


l" ISF quantizer of the 2""^ stage (not the 6.60 kbit/s mode) 


Qpisf_2s.tab 


Dico21Jsf_36b 


5*128 


l" ISF quantizer of the 2"'^ stage (the 6.60 kbit/s mode) 


Qpisf_2s.tab 


Dico22Jsf 


3*128 


2"" ISF quantizer of the 2"" stage (not the 6.60 kbit/s mode) 


Qpisf 2s.tab 


Dico22 isf 36b 


4*128 


2"" ISF quantizer of the 2"" stage (the 6.60 kbit/s mode) 



(continued) 
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Table 6 (concluded): Fixed tables 



File 


Table name 


Length 


Description 


Qpisf_2s.tab 
Qpisf_2s.tab 
Qpisf_2s.tab 
Qpisf_2s.tab 
Qpisf 2s.tab 


Dico23_isf 
Dico23_isf_36b 
Dico24_isf 
Dico25_isf 
Mean isf 


3*128 
7*64 
3*32 
4*32 
16 


3™ ISF quantizer of the 2™ stage (not the 6.60 kbit/s mode) 
3''' ISF quantizer of the 2"'' stage (the 6.60 l<bit/s mode) 
4* ISF quantizer of the 2"" stage (not the 6.60 l<bit/s mode) 
5* ISF quantizer of the 2"" stage (not the 6.60 l<bit/s mode) 
ISF mean 



4.5.3 Static variables used in tine C-code 

In this section two tables that specify the static variables for the speech encoder and decoder respectively are shown. All 
static variables are declared within a C struct. 
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Table 7: Speech encoder static variables 



Struct name 


Variable 


Type[Length] 

Word16[30] 


Description 


Coder_State 


mem_decim 


Decimation filter memory 




mem_sig_in 


Word16[6] 


Prefilter memory 




mem_preemph 


Word 16 


Preemphasis filter memory 




old_speech 


Word16[128] 


speech buffer 




old_wsp 


Word16[115] 


buffer holding spectral weighted speech 




old_exc 


Word16[248] 


excitation vector 




memjevinson 


Word16[18] 


Levinson memories 




Ispold 


Word16[16] 


Old ISP vector 




ispold_q 


Word16[16] 


Old quantized ISP vector 




past_isfq 


Word16[16] 


past quantized ISF prediction error 




mem_wsp 


Word 16 


Open-loop LIP deemphasis filter memory 




mem_decim2 


Word16[3] 


Open-loop LIP decimation filter memory 




mem_wO 


Word 16 


weighting filter memory (applied to error signal) 




mem_syn 


Word16[16] 


synthesis filter memory 




tilt_code 


Word 16 


Preemhasis filter memory 




old_wsp_max 


Word 16 


Open loop scaling factor 




old wsp shift 


Word 16 


Maximum open loop scaling factor 




Q_old 


Word 16 


Old scaling factor 




Q_max 


Word16[2] 


Maximum scaling factor 




gp_clip 


Word16[2] 


memory of pitch clipping 




qua_gain 


Word16[4] 


Gain quantization memory 




old_TO_med 


Word 16 


weighted open loop pitch lag 




oLgain 


Word 16 


Open-loop gain 




ada_w 


Word 16 


weigthing level depeding on open loop pitch gain 




ol_wght_flg 


Word 16 


switches lag weighting on and off 




old_ol_lag 


Word16[5] 


Open loop lag history 




hp_wsp_mem 


Word16[9] 


Open-loop lag gain filter memory 




old hp wsp 


Word16[243] 


Open-loop lag 




vadSt 


VadVars* 


see below in this table 




dtx_encSt 


dtx encState* 


see below in this table 




first frame 


Word 16 


First frame indicator 




Isfold 


Word16[16] 


Old ISF vector 




L_gc_tlires 


Word 16 


Noise enhancer threshold 




mem_syn_hi 


Word16[16] 


synthesis filter memory (most significant word) 




mem_syn_lo 


Word16[16] 


synthesis filter memory (least significant word) 




mem_deemph 


Word 16 


Deemphasis filter memory 




mem_sig_out 


Word16[6] 


HP filter memory in the synthesis 




mem_hp400 


Word16[6] 


HP filter memory 




mem_oversamp 


Word16[2*12] 


Oversampling filter memory 




mem_syn_hf 


Word16[16] 


Higher band synthesis filter memory 




mem_hf 


Word16[30] 


Estimated BP filter memory (23.85 kbit/s mode) 




mem_hf2 


Word16[30] 


Input BP filter memory (23.85 kbit/s mode) 




mem_hf3 


Word16[30] 


Input LP filter memory (23.85 kbit/s mode) 




seed2 


Word 16 


Random generation seed 




disp_mem 


Word16[8] 


Phase dispersion memory 




vad_hist 


Word 16 


VAD history 




Gainalpfia 


Word 16 


Higher band gain weighting factor (23.85 kbit/s 
mode) 


dtx_encState 


lsf_hist 


Word16[128] 


LSP history (8 frames) 




Log en hist 


Word16[8] 


logarithmic frame energy history (8 frames) 




Hist_ptr 


Word 16 


pointer to the cyclic history vectors 




Log_en_index 


Word 16 


Index for logarithmic energy 




Cng seed 


Word 16 


Comfort noise excitation seed 




D 


Word16[28] 


ISF history distance matrix 




sumD 


Word16[8] 


Sum of ISF history distances 




dtxHangoverCount 


Word 16 


is decreased in DTX hangover period 




decAnaElapsedCount 


Word 16 


counter for elapsed speech frames in DTX 


vadStatel 


bckrest 


Word16[12] 


background noise estimate 




avejevel 


Word16[12] 


averaged input components for stationary estimation 




oldjevel 


Word16[12] 


input levels of the previous frame 




subjevel 


Word16[12] 


input levels calculated at the end of a frame 
(lookahead) 




a_data5 


Word16[5][2] 


memory for the filter bank 




a_data3 


Word16[6] 


memory for the filter bank 




burst_count 


Word 16 


counts length of a speech burst 
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Struct name 


Variable 


Type[Length] 


Description 




Hang_count 


Word 16 


hangover counter 




Stat count 


Word 16 


stationary counter 




Vadreg 


Word 16 


1 5 flags for intermediate VAD decisions 




ToneJIag 


Word 16 


1 5 flags for tone detection 




sp est cnt 


Word 16 


Speech level estimation counter 




Sp max 


Word 16 


Maximum signal level 




sp max cnt 


Word 16 


Maximum level estimation counter 




Speechjevel 


Word 16 


Speech level 




prev_pow_sum 


Word 16 


Power of previous frame 



Table 8: Speech decoder static variables 



Struct name 


Variable 


Type[Length] 


Description 


DecoderState 


old_exc 


Word16[248] 


excitation vector 




ispold 


Word16[16] 


Old ISP vector 




isfold 


Word16[16] 


Old ISF vector 




isf_buf 


Word16[48] 


ISF vector history 




pastjsfq 


Word16[16] 


past quantized ISF prediction error 




tilt code 


Word 16 


Preemhasis filter memory 




Q_old 


Word 16 


Old scaling factor 




Qsubfr 


Word 16 


Scaling factor history 




L_gc_thres 


Word 16 


Noise enhancer threshold 




mem_syn_hi 


Word16[16] 


synthesis filter memory (most significant word) 




mem_syn_lo 


Word16[16] 


synthesis filter memory (least significant word) 




mem_deemph 


Word 16 


Deemphasis filter memory 




mem_sig_out 


Word16[6] 


HP filter memory in the synthesis 




mem_oversamp 


Word16[24] 


Oversampling filter memory 




mem_syn_hf 


Word16[20] 


Higher band synthesis filter memory 




mem_hf 


Word16[30] 


Estimated BP filter memory (23.85 kbit/s mode) 




mem_hf2 


Word16[30] 


Input BP filter memory (23.85 kbit/s mode) 




mem_hf3 


Word16[30] 


Input LP filter memory (23.85 kbit/s mode) 




seed 


Word 16 


Random code generation seed for bad frames 




seed2 


Word 16 


Random generation seed for higher band 




old_TO 


Word 16 


Old LIP lag (integer part) 




old_TO_frac 


Word 16 


Old LIP lag (fraction part) 




lag_hist 


Word16[5] 


LIP lag history 




dec_gain 


Word16[23] 


Gain decoding memory 




seeds 


Word 16 


Random LIP lag generation seed for bad frames 




disp_mem 


Word16[8] 


Phase dispersion memory 




mem_hp400 


Word16[6] 


HP filter memory 




prev_bfi 


Word 16 


Previous BFI 




state 


Word 16 


BGH state machine memory 




firstjrame 


Word 16 


First frame indicator 




dtx decSt 


dtx decState* 


see below in this table 




Vad hist 


Word 16 


VAD history 


dtx_decState 


Since_last_sid 


Word 16 


number of frames since last SID frame 




true_sid_period_inv 


Word 16 


inverse of true SID update rate 




log_en 


Word 16 


logarithmic frame energy 




old log en 


Word 16 


previous value of log_en 




isf 


Word16[16] 


ISF vector 




lsf_old 


Word16[16] 


Previous ISF vector 




Cng seed 


Word 16 


Comfort noise excitation seed 




lsf_hist 


Word16[128] 


ISF vector history (8 frames) 




Log en hist 


Word16[8] 


logarithmic frame energy history 




Hist_ptr 


Word 16 


index to beginning of LSF history 




dtxHangoverCount 


Word 16 


counts down in hangover period 




DecAnaElapsedCount 


Word 16 


counts elapsed speech frames after DTX 




sidjrame 


Word 16 


flags SID frames 




valid_data 


Word 16 


flags SID frames containing valid data 




log_en_adjust 


Word 16 


mode-dependent frame energy adjustment 




dtxHangoverAdded 


Word 16 


flags hangover period at end of speech 




dtxGlobalState 


Word 16 


DTX state flags 




data_updated 


Word 16 


flags CNI updates 
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Homing procedure 



The principles of the homing procedures are described in [2]. This specification only includes a detailed description of 
the 9 decoder homing frames. For each AMR-WB codec mode, the corresponding decoder homing frame has a fixed set 
of parameters. The parameters in serial format are packed into parameters in 15-bit-long format where the first serial bit 
is inserted into most significant bit in the 15-bit-long format. These 15-bit-long parameters do not represent real speech 
parameters, but they decrease memory consumption compared to the speech parameters. Table 9 shows the homing 
frame in 15-bit-long format for different modes. In the decoder, the received speech parameters in serial format are first 
converted into 15-bit-long format. Then the obtained parameters are compared against the homing frame table values 
(Table 9). 



Table 9: Table values for the decoder homing frame in 15-bit-long format for different modes 



Mode 



Value (MSB=bO) 



3168, 29954, 29213, 16121, 64, 13440, 30624, 16430, 19008 

3168, 31665, 9943, 9123, 15599, 4358, 20248, 2048, 17040, 27787, 16816, 13888 

3168, 31665, 9943, 9128, 3647, 8129, 30930, 27926, 18880, 12319, 496, 1042, 4061, 20446, 25629, 

28069, 13948 

3168, 31665, 9943, 9131, 24815, 655, 26616, 26764, 7238, 19136, 6144, 88, 4158, 25733, 30567, 30494, 

221,20321, 17823 

3168, 31665, 9943, 9131, 24815, 700, 3824, 7271, 26400, 9528, 6594, 26112, 108, 2068, 12867, 16317, 

23035, 24632, 7528, 1752, 6759, 24576 

3168, 31665, 9943, 9135, 14787, 14423, 30477, 24927, 25345, 30154, 916, 5728, 18978, 2048, 528, 

16449, 2436, 3581, 23527, 29479, 8237, 16810, 27091, 19052, 

3168, 31665, 9943, 9129, 8637, 31807, 24646, 736, 28643, 2977, 2566, 25564, 12930, 13960, 2048, 834, 
3270, 4100, 26920, 16237, 31227, 17667, 15059, 20589, 30249, 29123, 

3168, 31665, 9943, 9132, 16748, 3202, 28179, 16317, 30590, 15857, 19960, 8818, 21711, 21538, 4260, 

16690, 20224, 3666, 4194, 9497, 16320, 15388, 5755, 31551, 14080, 3574, 15932, 50, 23392, 26053, 

31216 

3168, 31665, 9943, 9134, 24776, 5857, 18475, 28535, 29662, 14321, 16725, 4396, 29353, 10003, 17068, 

20504, 720, 0, 8465, 12581, 28863, 24774, 9709, 26043, 7941, 27649, 13965, 15236, 18026, 22047, 

16681, 3968 



6 File formats 

This section describes the file formats used by the encoder and decoder programs. The test sequences defined in [1 also 
use the file formats described here. 

6.1 Speech file (encoder input / decoder output) 

Speech files read by the encoder and written by the decoder consist of 16-bit words where each word contains a 14-bit, 
left aligned speech sample. The byte order depends on the host architecture (e.g. MSByte first on SUN workstations, 
LSByte first on PCs etc.). Both the encoder and the decoder program process complete frames (of 320 samples) only. 

This means that the encoder will only process n frames if the length of the input file is n'*320 + k words, while the files 
produced by the decoder will always have a length of n*320 words. 

6.2 Mode control file (encoder input) 

The encoder program can optionally read in a mode control file which specifies the encoding mode for each frame of 
speech processed. The file is a text file containing one number per speech frame. Each line contains one of the mode 
numbers 0-8. 
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6.3 Parameter bitstream file (encoder output / decoder input) 

The files produced by the speech encoder/expected by the speech decoder contain an arbitrary number of fi^ames in the 
following available formats. 

NOTE ON DEFAULT 3GPP AND ITU BITSTREAM FORMATS: 

ITU stream format gives very limited possibilities to distinguish NO_DATA and SID_FIRST frame types at the 
beginning of a stream. In some very limited cases for which some instance between encoder and decoder cuts of the 
first hangover period frames (e.g. handovers, editing of the stream), the output of the decoder is different depending on 
the stream format, ITU or default 3GPP. 

Default 3GPP format: 

This is the default format used in 3GPP. This format shall be used when the codec is tested against the test vectors. 



T YP E_OF_FRAME_T YP E 


FRAME_TYPE 


MODE 


Bl 


B2 




Bnn 



Each box corresponds to one Wordl 6 value in the bitstream file, for a total of 3+nn words or 6+2nn bytes per frame, 
where nn is the number of encoded bits in the frame. Each encoded bit is represented as follows: Bit = OxffSl, Bit 1 
0x007f The fields have the following meaning: 



TYPE_OF_FRAME_TYPE transmit 

TX_TYPE 
RX TYPE 



frame type, 

(0x6b21) 
(0x6b20) 



which 



IS 



one 



of 



If TYPE OF FRAME TYPE is TX TYPE, 



FRAME_TYPE transmit frame type, 

TX_SPEECH (0x0000) 

TX_SID_FIRST (0x0001) 

TX_SID_UPDATE (0x0002) 

TX NO DATA (0x0003) 



which 



IS 



one 



of 



If TYPE_OF_FRAME_TYPE is RX_TYPE, 

FRAME_TYPE transmit frame type, which 

RX_SPEECH_GOOD (0x0000) 

RX SPEECH PROBABLY DEGRADED (0x0001) 



IS 



one 



of 



_SPEECH_LOST 


0x0002 


_SPEECH_BAD 


0x0003 


_SID_FIRST 


0x0004 


_SID_UPDATE 


0x0005 


_SID_BAD 


0x0006 


_NO_DATA 


0x0007 



B0...B2nn 



speech encoder parameter bits (i.e. the bitstream itself). Each bx either has the 

value 0x0081 (for bit 0) orOxOOTF (for bit 1). 



MODE INFO 



encoding 

6.60 
8.85 



mode 



kbit/s mode 

kbit/s mode 

12.65 kbit/s mode 

14.25 kbit/s mode 

15.85 kbit/s mode 

18.25 kbit/s mode 

19.85 kbit/s mode 

23.05 kbit/s mode 

23.85 kbit/s mode 



information, 

(0x0000) 
(0x0001) 
(0x0002) 
(0x0003) 
(0x0004) 
(0x0005) 
(0x0006) 
(0x0007) 
(0x0008) 



which 



IS 



one 



of 



As indicated in section 6.1 above, the byte order depends on the host architecture. 
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ITU format (activated with command line parameter -itu) 



SYNC_WORD 


DATA_LENGTH 


Bl 


B2 




Bnn 



Each box corresponds to one Wordl 6 value in the bitstream file, for a total of 2+nn words or 4+2nn bytes per frame, 
where nn is the number of encoded bits in the frame. Each encoded bit is represented as follows: Bit = 0x007f, Bit 1 = 
0x0081. The fields have the following meaning: 



SYNC_WORD 



Word to ensure correct frame synchronization between tine encoder and tine 
decoder. It is also used to indicate the occurrences of bad frames. 



DATA_LENGTH 



In the encoder output: 
In the decoder input: 



(0x6b21) 

Good frames (0x6b21) 
Bad frames (0x6b20) 



Length of the speech data. Codec mode and frame type is extracted in the 
decoder using this parameter: 



DATA 








_LENGTH 


PREVIOUS FRAME 


CODEC MODE 


FRAMETYPE 





RX_SPEECH_GOOD/ 
RX_SPEECH_BAD 


DTX 


RX_SID_FIRST 





OTHER THAN 

RX_SPEECH_GOOD/ 
RX_SPEECH_BAD 


DTX 


RX_NO_DATA 


35 


~ 


DTX 


RX_SID_UPDATE/ 
RX_SID_BAD 


132 


~ 


6.60 kbit/s 


RX_SPEECH_GOOD/ 
RX_SPEECH_BAD 


177 


~ 


8.85 kbit/s 


RX_SPEECH_GOOD/ 
RX_SPEECH_BAD 


253 


~ 


12.65 kbit/s 


RX_SPEECH_GOOD/ 
RX_SPEECH_BAD 


285 


~ 


14.25 kbit/s 


RX_SPEECH_GOOD/ 
RX_SPEECH_BAD 


317 


~ 


15.85 kbit/s 


RX_SPEECH_GOOD/ 
RX_SPEECH_BAD 


365 


~ 


18.25 kbit/s 


RX_SPEECH_GOOD/ 
RX_SPEECH_BAD 


397 


~ 


19.85 kbit/s 


RX_SPEECH_GOOD/ 
RX_SPEECH_BAD 


461 


~ 


23.05 kbit/s 


RX_SPEECH_GOOD/ 
RX_SPEECH_BAD 


477 


~ 


23.85 kbit/s 


RX_SPEECH_GOOD/ 
RX_SPEECH_BAD 



MIME/file storage format (activated with command line parameter -mime) 

Detailed description of the AMR-WB single channel MIME/file storage format can be found in [7] (sections 5.1 and 
5.3). This format is used e.g. by the Multimedia Messaging Service (MMS). 
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Annex A (informative): 
Change history 



Change history 


Date 


TSG# 


TSG Doc. 


CR 


Rev 


Subject/Comment 


Old 


New 


03-2001 


11 


SP-010083 






Version 2.0.0 provided for approval 




5.0.0 


06-2001 


12 


SP-010307 


001 




Unnecessary printing in Az isp-function 


5.0.0 


5.1.0 


06-2001 


12 


SP-010307 


002 




Overflow in isp az.c 


5.0.0 


5.1.0 


06-2001 


12 


SP-010307 


003 




Error in the ISP extrapolation in 6.60 kbit/s mode 


5.0.0 


5.1.0 


06-2001 


12 


SP-010307 


004 




14-bit masking to decoder 


5.0.0 


5.1.0 


06-2001 


12 


SP-010307 


005 




Correction of the homing function 


5.0.0 


5.1.0 


06-2001 


12 


SP-010307 


006 




Fixed codebook initialisation 


5.0.0 


5.1.0 


06-2001 










IVIinor editorial to cover page 


5.1.0 


5.1.1 


09-2001 


13 


SP-010455 


007 




Error in the C-code of the encoder homing function 


5.1.1 


5.2.0 


09-2001 


13 


SP-010455 


008 




Inconsistency in the file format description 


5.1.1 


5.2.0 


12-2001 


14 


SP-010699 


009 




Incorrect mode usage during DTX 


5.2.0 


5.3.0 


12-2001 


14 


SP-010699 


010 




Correction of decoder homing function for 23.85 kbit/s 
mode 


5.2.0 


5.3.0 


03-2002 


15 


SP-020081 


Oil 


2 


Correction of mode reading and memory usage 


5.3.0 


5.4.0 


03-2002 


15 


SP-020081 


012 




Correction of pitch calculation of AIVIR-WB encoder 


5.3.0 


5.4.0 


03-2002 


15 


SP-020081 


013 




Error concealment of high band gain in 23.85 kbit/s mode 


5.3.0 


5.4.0 


1 2-2002 


18 


SP-020692 


014 




Correction of ambiguous expression in the AMR-WB C- 
Code 


5.4.0 


5.5.0 


03-2003 


19 


SP-030089 


015 


2 


Harmonization of 3GPP TS 26.1 73 and ITU-T G.722.2 C- 
codes 


5.5.0 


5.6.0 


03-2003 


19 


SP-030089 


016 




Correction for handling of RX NO DATA frames 


5.5.0 


5.6.0 


06-2003 


20 


SP-030216 


017 


1 


IVIMS compatible input/output option for fixed-point AMR- 
WB source code 


5.6.0 


5.7.0 










Added file containing the C-code accidentally omitted 
from previous version 


5.7.0 


5.7.1 


09-2003 


21 


SP-030446 


019 




Possible decoder LPC coefficients overflow 


5.7.1 


5.8.0 


1 2-2004 


26 


SP-040844 


020 


1 


Incorrect definition of vector nb of bits 


5.8.0 


6.0.0 


1 2-2006 


34 


SP-060846 


0023 


1 


Correction to bug in ITU-T bitstream format in the 
presence of frame erasures 


6.0.0 


6.1.0 
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